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NOTE  ON  THE  PROPORTION  OF  GENETIC  DEVIATES 
IN  THE  TAILS  OF  A  NORMAL  POPUIATION 

D*  S,  Robson  ^and  LeRoy  Powers  ^ 

Introduction 

The  phenotypic  array  exhibited  by  a  segregating  genetic  population 
reflects  both  the  genetic  and  the  environmental  variability  within  the 
population.  As  a  consequence,  an  element  of  uncertainty  attaches  to 
selection  for  genetically  superior  individuals  on  the  basis  of  their 
phenotypic  traits.  The  latter  may,  by  chance,  be  merely  the  result  of 
an  unusTially  favorable  environment  acting  upon  a  genotype  which  under 
less  favorable  conditions  would  display  only  a  mediocre  or  even  undesirable 
phenotype.  Chances  for  the  occurence  of  such  phenotypic  deception  depend, 
of  course,  upon  the  magnitude  of  the  environmentally  induced  variability 
as  compared  to  that  due  to  genetic  differences. 

Any  mathematical  formulation  of  this  problem  to  allow  the  geneticist 
to  numerically  evaluate  his  chances  for  successful  selection  requires  a 
detailed  description  of  the  phenotypic  frequency  distribution  in  the  pop¬ 
ulation,  Structurally,  the  total  segregating  population  may  be  regarded 
as  a  mixture  of  subpopulations,  with  each  subpopulatlon  representing  the 
distribution  of  phenotypes  produced  by  a  single  genotype  under  the  existing 
raxige  of  environmental  conditions,  smd  with  each  subpopulation  or  genotype 
contributing  to  the  total  population  in  proportion  to  its  genotypic  fre¬ 
quency.  A  mathematical  description  of  the  population  therefcare  consists 
of  specifying  the  relative  frequency  of  each  genotype  and  the  exact  form 
of  its  sissociated  distribution  of  phenotypes. 

Under  most  circumstances  where  selection  is  practiced  for  economic 
purposes  a  large  number  of  both  genetic  and  environmental  factors  operate 
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at  variable  levels  to  determine  the  phenotypes  appearing  in  the  population. 
Empirical  evidence  supports  the  belief  that  in  this  case  the  total  frequency 
distribution  and  also  the  component  distributions  for  a  quantitative  pheno¬ 
typic  trait  are  approximately  Gaussian  in  form*  A  standard  population  model 
which  has  therefore  come  into  use  in  such  problems  as  the  prediction  of  ad¬ 
vancement  under  selection  is  Eisenhart’s  Model  II  (19^7)  representing,  in 
the  simplest  case,  a  normal  mixture  of  normal  subpopulations  with  constant 
variance.  Each  genotype  is  assumed  to  generate  a  normal  distribution  of 
phenotypes  under  the  existing  range  of  environmental  conditions,  and  the 
distribution  of  phenotypic  means  (called  genotypic  values)  is  likewise 
normal* 


Graphs  of  the  Proportion  of  Genetic  Deviates 

The  phenotypic  value  X  for  some  quantitative  trait  of  an  individual 
selected  at  random  from  a  genetic  population  may  be  regarded  conceptually 
os  the  sum  of  two  components, 

G  »  average  phenotype  for  the  genotype  of  the  chosen  individual 

E  a  deviation  of  the  particular  phenotype  of  the  chosen  individual 
from  the  average  phenotype  (6)  for  the  genotype  of  that 
Individual  =  X-G 
or 

X  «  G  +  E 

The  first  component  G  is  conventionally  called  the  genotypic  value  and  E 
is  the  environmental  effect.  If  the  population  structure  is  a  normal 
mixture  of  normal  subpopulations  having  a  common  environmental  variance 
then  the  chance  variables  G  and  E  follow  independent  normal  distributions, 
G  having  a  mean  value  of  g  and  variance  ,  E  having  a  mean  value  of  zero 
and  variance  of  ,  so  that  X  itself  folloi/s  a  normal  distribution  with  mean 
g  and  variance  of  +  af  , 

A  probability  distribution  of  particular  interest  to  the  geneticist  is 
the  a  posteriori  distribution  of  G  among  individuals  of  a  fixed  pheno¬ 
type  xf  that  is,  given  that  he  has  selected  an  individual  of  phenotype 
X,  the  geneticist  is  then  concerned  with  the  probability  that  this  chosen 
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indivldtial  is  of  sujjerlor  genotype,  say  greater  thein  some  standard  value 
g*  For  any  selected  phenotypic  value  x  the  conditional  distribution  of 
genotypic  values  G  is  in  this  case  normal  with  mean  g  +  h  (x-g)  and 
variance  (l-h),  where  h  is  the  herltabllity  ratio 


gS  4.  (j2 

®  s 

or  the  ratio  of  genetic  to  total  variance  in  the  population* 

The  desired  probability  that  the  genotypic  value  will  exceed  some  specified 
value  g  is  therefore  given  by  the  standard  cumulative  normal  probability 
S  (lg+h(x**’g)-g]  /  Oj/l-^h)  or,  expressing  x  and  g  in  standard  units  as 


Vof  +  and  g’ 


we  obtain  the  simplified  form 


- 

JT^  t= 


5(Cjc'  /h-g']  /  ^ 


which  depends  on  the  variance  components  only  through  the  herltabllity 
ratio.  A  special  case  of  some  interest  is  where  g  <="&>  giving 

the  probability  that  an  individual  of  phenotype  x  will  be 
genetically  above  aver age j  note  that  when  h  =  ^  this  reduces  to 

S(x’)  =5  area  to  the  left  of  x’  under  the  standard  normal  curve. 

For  example,  with  x’  =  1.645  and  h  =  ^,  the  probability  that  the  genotypic 
value  of  the  selected  individual  exceeds  the  population  mean  is 
S (1.645)  =  ,95j  if  h  were  J  instead  of  then  the  desired  probability  would 
be  2(1.645/  /5)  =  S(.950)  =  .83,  while  h  »  ^/k  would  give 
5(1.645  /^)  =  5(2.849)  =  .998.  These  answers  may  be  interpolated  from 
Figure  1  where  the  general  form  of  the  solution  is  illustrated  by  plotting 
5([x’  y£-g*]  /  /1-b)  as  a  function  of  g*  with  x*  fixed  at  the  upper  5  per¬ 
cent  tail  value  of  1.645  and  with  h  taking  the  values  0.1  to  0.9  by  steps 
of  0.1.  Figures  2  and  3  display  the  corresponding  curves  with  x^  fixed  at 
the  upper  10  percent  value  of  1.282  and  the  upper  quartile  value  of  .674, 
respectively. 
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Where  selection  operates  on  the  entire  upper  tail  of  the  phenotypic  dis¬ 
tribution,  talcing  all  individueils  whose  phenotypic  value  exceeds  x,  the 
question  of  interest  becomes  the  proportion  of  these  selected  Individuals 
having  genotypic  values  exceeding  g*  More  precisely,  this  proportion  may 
be  interpreted  as  the  conditional  probability  that  a  randomly  selected 
individual  exhibiting  a  phenotypic  value  greater  than  x  will  also  have  a 
genotypic  value  greater  than  g.  Again,  the  solution  to  this  problem  is 
expressible  in  integral  form  as 

X  ’ 

though  in  this  case  the  integral  is  not  previously  tabulated.  The  only 
non-trlvial  case  which  permits  further  analytic  reduction  is  g*  =  0  and 
h  =  when  half  of  the  total  variance  is  genetic  then  the  probability 
that  an  individual  randomly  selected  from  the  region  X>x  will  be  geno¬ 
typically  above  average  is  [1  +  S(x')]  /8»  Thus,  in  this  case,  the  solution 
may  be  read  directly  from  the  cumulative  normal  tables^  for  example,  the 
proportion  of  above  average  genotypes  in  the  upper  5  percent  tail  of  the 
phenotypic  distribution  is  then 

[1  +  «(1.645.'I]  /2  =  [1  +  .95]  /2  =  .975. 

A  numerical  solution  in  the  general  case  can  be  obtained  only  by  numerical 
integration,  for  which  the  most  convenient  approach  is  to  hold  g’  and  h 
fixed  and  allow  x’  to  vary.  Graphs  of  this  probability  function  of  g» 
appearing  in  Figures  4-12  are  therefore  plotted  as  functions  of  x-,  with 
each  curve  corresponding  to  a  fixed  value  of  g’  and  h.  For  example,  from 
the  curve  for  P, i  =  .05  (g'  =  1.645)  and  h  =  ^  we  see  that  in  the  upper 
25  percent  tail  of  the  phenotypic  distribution  (x’  =  .674)  the  proportion 
of  standardized  genetic  deviates  exceeding  g*  =  lo645  is  .I??  in  other 
words,  17  percent  of  the  phenotypic  top  25  percent  also  belong  to  the 
genotypic  top  5  percent  of  the  population  when  half  of  the  variability  is 
genetic. 

A  more  complete  picture  of  the  expected  proportior^s  of  genetic  de¬ 
viates  may  he  constructed  in  histogram  form  by  a  slight  extension  of  the 
results  in  Figures  4-12.  For  a  set  of  phenotypic  intervals 
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(jCj  tXg  (X^  “)  on  the  standard  scale 

Xt"S 

/cg^  +  ce^ 

the  proportion  P, (x, of  genotypic  values  exceeding  the  population 
mean  by  an  amount  gc,  can  be  computed  from  the  formula 

P  fx  X  1 

»X(  +1 ;  - - - - — — - - - - 

[i-a(xj)]  -  [i-5(x,^.j)] 

Again  referring  to  Figure  8  for  the  case  h  =  0.5,  we  illustrate  this 
procedure  with  intervals  of  length  x,^,.x,  =  0.5  between  x,  -  .25  and 
X,  =  xi,  =  +2.25.  Taking  the  Interval  (1.25,  1.75),  for  example,  we  find 
[either  by  visual  interpolation  on  the  right  hand  scale  of  Figure  8  or 
directly  from  tables  of  the  standard  cumulative  normal  distribution) 

i-a(i.25)  =  .1056  1-5(1.75)  =  .o4oi 

[1-5(1.25)]  '  [1-5(1.75)3  =  .0655 

Then  to  find  the  expected  proportion  of  above  average  genotypes  in  this 
interval  we  refer  to  the  P  =  0.50  (or  g  =  0)  curve  in  Figure  8,  at  x,  .  1. 


and  at  x,+i  =  1,75, 

^***l+X  ^  ^0.1.76  *980 

SO  that 

)  =  Pq  (1.25,1.75)  =  ( .1056 ) - , 980 ( .ohoi ) 

»0655 

.0607 
"  To555 

=  .9267 

Thus,  when  h  =  6.55  percent  of  the  population  falls  in  the  phenotypic 

interval 

1-25  <1.75 

/“f  ^ 


25 
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and  92*67  percent  of  the  individuals  falling  in  this  interval  axe 
genotypically  above  average. 

Similarly,  from  the  curve  for  P  a  0.25  (g  =  *67^5)  in  Figure  8  we 

find 


^***1  ”  ^,6745  .1  .36  “  •'^56 

so  that 

=  2^3745(1.25^1*75) 


“  ^,6748*1,76  “ 

.756(.1056).»c867(.0»^01) 

.0655 


8 


.0451 

To5% 


=  .6885 


Thus,  68.85  percent  of  the  individuals  in  this  interval  belong  to  the  top 
quartile  of  the  genotypic  distribution.  By  subtraction,  92.67  -  68.85  = 
25.82  percent  of  the  Individuals  in  this  interval  belong  to  the  second 
quartile  of  the  genotypic  distribution.  The  remaining  computations  for 
this  and  other  Intervals  were  performed  in  the  same  manner  and  are  shown 
in  Table  1.  The  resulting  histogram  is  plotted  in  Figure  13,  indicating 
for  each  phenotypic  class  the  proportions  of  individuals  belonging  to 
the  various  percentiles  of  the  genotypic  distribution. 


Table  1  Percentage  of  individuals  in  a  phenotypic  class  which  belong  to 
the  top  P,  percent  of  the  genotypic  distribution  when  h  =  ^» 

Phenotypic  Phenotypic  Percentage  of  the  class  frequency  belonging  to  the 

class  frequency  top  percent  of  the  genotypic  distribution 


interval 

(percent) 

P,=»  50 

P,=  25 

P,=  10 

P,«  5 

.25  to  .75 

17.47 

68.63 

51.54 

9.56 

5.43 

0.29 

.75  to  1.25 

12,10 

fl509 

50.17 

20.58 

9.09 

1.07 

1.25  to  1.75 

6.55 

92.67 

68.85 

56.79 

19.85 

3.51 

1.75  to  2.25 

2c79 

97.49 

83.51 

55.91 

55.84 

9.32 

2.25  to 

1.22 

99.18 

94.26 

77.05 

59.84 

25. 4l 
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Tbe  Effect  of  Population  Size;  Before  Selection 

The  preceding  results  describe  some  characteristics  of  an  abstract 
infinite  population,  while  the  genetic  population  actually  observed  and 
selected  from  is  of  finite  size  N,  representing  only  a  san^le  frem  this 
potential  infinite  population.  Finiteness  of  the  observed  population  has 
no  effect  on  the  results  plotted  in  Figures  1  -  5>  which  are  conditional 
on  a  single  selected  phenotypic  value;  but  the  application  of  Figures  4  - 
12,  which  are  conditional  on  a  selected  upper  tail  of  the  phenotypic  dis¬ 
tribution,  requires  further  explanation  in  the  finite  case. 

A  selection  procedure  which  takes  all  Individuals  in  the  tail  of  a 
distribution  may  be  defined  in  essentially  two  different  ways,  by  specify¬ 
ing  either  (i)  the  minimum  acceptable  phenotypic  value  or  (li)  the  per¬ 
centile  of  the  observed  i>henotyplc  distribution  at  which  selection  begins. 
\Jhen  the  minimum  acceptable  phenotype  is  fixed  in  advance  then  the  number 
or  percentage  of  the  prospective  population  of  size  N  which  will  be  selected 
is  a  cliance  variable,  unknown  in  advance  of  tlie  selection  experiment,  while 
if  the  selection  rate  or  percentage  is  fixed  then  the  minimum  phenotypic 
value  which  id.ll  be  accepted  is  a  chance  variable  and  unknown  in  advance 
of  the  experiment.  In  either  case,  chance  variations  in  the  respective 
unknoim  quantities  will  decrease  as  H  is  increased,  and  as  N  approaches 
infinity  the  two  procedures  become  equivalent;  that  is,  when  population 
size  is  infinite,  the  specification  that  selection  will  take  all  individuals 
with  phenotypic  values  exceeding  x  is  equivalent  to  the  specification  that 
selection  will  take  a  fraction  1-5 (x)  from  the  upper  tail  of  the  phenotypic 
distribution.  This  asymptotic  equivalence  is  expressed  in  Figures  4-12 
by  labeling  the  ordinate  with  both  the  scale  of  x  and  1-5 (x). 

For  the  purposes  of  planning  a  finite  selection  experiment  of  eitlier 
type,  the  potential  magnitude  of  chance  fluctuations  in  the  respective 
unknown  quantities  must  he  considered.  For  example,  if  a  type  (i)  ex¬ 
periment  is  conten5)lated  with  the  minimum  acceptable  phenotype  at  some 
preassigned  level  x  then  rational  planning  requires  that  the  population 
size  N  be  chosen  large  enough  to  provide  reasonable  assurance  that  at 
least  one  of  the  N  phenotypes  will  exceed  x.  The  preassigned  value  of  x 
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in  this  case  is  presxjinably  hasecL  on  considerations  of  the  facts  reveeded 
in  Figures  1-12,  and  is  likewise  chosen  to  provide  reasonable  assurance 
that  an  individual  of  this  phenotype  will  be  of  superior  genotype. 

Analysis  of  the  type  (l)  experiment  model  shows  that  the  probability 
of  finding  at  least  one  phenotype  exceeding  x  =  (X-g)  /  Jaf  +  of  in  a 
population  of  size  N  is  1  -  [5(x)]",  and  the  probability  of  finding  one 
whose  genotype  also  exceeds  some  specified  value  g  »  CG-g)/aj  isl- 
[1-P,  (l-S(x))]N  .  Thus,  in  order  to  obtain  100(l-<if)  percent  assurance 

that  the  type  (i)  es^eriment  will  produce  at  least  one  selection,  population 
size  N  must  be  chosen  to  satisfy  the  equation 

1  -  [5(x)]N  B  1  -  a 
or 

N  = 

log  a(x) 

In  order  to  provide  the  same  assurance  of  obtaining  at  least  one  selection 
with  a  genotypic  value  exceeding  any  specified  value  g,  a  larger  value  of  K 
is  required, 

«  _ log  g _ 

^  “  Log[i-p,rxVa5(x))] 

For  example,  if  all  phenotypes  greater  than  two  standard  deviations  above 
the  mean  are  to  be  selected  (x  =  2,5(x)  =  .9775)  then  in  order  to  provide 
SO  percent  assurance  (a  =  O.l)  of  obtaining  at  least  one  selection  from 
the  top  5  percent  of  the  genotypic  distribution  (P^  =  .05,  g  =  1*645)  the 
population  size  N  must  be  chosen  as  follows 


h 

.* 

N 

0.1 

.1744 

580 

0.2 

.2597 

590 

0.5 

.5*^27 

295 

0.4 

.4270 

255 

0.5 

.5146 

196 

0.6 

.6071 

165 

0.7 

,7062 

145 

0.8 

.8159 

124 

0.9 

.9295 

108 

1.0 

1 

101 
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The  values  of  P,  in  these  computations  were  those  obtained  by  numerical 
integration#  but  may  be  read  with  two-digit  accuracy  from  Figures  4  -  12# 
respectively.  As  an  illustration,  referring  to  Figure  8  with  h  =  0.5 
we  find  that  the  c\irve  for  P,  =  ,05  intersects  with  x  =  2  at  Pj  ^  *515, 
giving 

logio  1 

^  “  log^o  C1-«0227(.515)]  “  lo'gjo  0.9683  “  Toosil  " 

A  more  detailed  characterization  of  a  contemplated  type  (i)  ex¬ 
periment  is  given  by  the  probability  of  selecting  exactly  m  lndivldua3.s 
belonging  to  the  top  100  P^  percent  of  the  genotypic  distribution,  which 
is  the  binomial  probability 

(^)  P,2,Cl-ai(x)]“  Cl-P,.^(l-5(x))]N-- 
or  approximately 

[NP,  (l-5(x)]«  -HP,  (l-5(x)) 
ml  ■  ® 

Thus,  for  the  case  illustrated  above,  with  W  =  196  the  probability  of 
obtaining  exactly  one  selection  from  the  genetic  top  5  percent  is 

(.0227) (.515) (.9883)^^^  2.291558  (.1008)  »  .231 

or  by  the  approximation 

2.291558  (.1011)  =  .232 

Similarly,  the  chance  of  obtaining  exactly  two  selections  of  this  kind  is 
approximately 

(.1011)  =  .265 

while  for  m  =  3 

(.1011)  =  .205 

and  for  m  =  4 


(.1011)  -  .116 
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and  so  on,  the  probabilities  fran  m  =  1  onward  adding  to  the  previously 
specified  1-a  =  •90»  The  expected  nupiber  of  such  selections  is 
HPj,,  (l-a(x)),  or  in  this  case  2.291338. 

If  a  type  (li)  experiment  is  contemplated  v;ith  a  fixed  selection  rate 
of  100  (1-2)  percent  then  the  population  size  N  should  be  chosen  large 
enough  to  provide  reasonable  assxtrance  (l-a)  that  the  k  =  N(l«a)  selections 
are  genetically  superior,  or  at  least  that  the  best  of  these  selected 
phenotypes  is  a  genetically  superior  individual.  A  characterization  of 
the  type  (il)  model  by  means  of  the  probability  distribution  of  m,  the 
nuniber  of  selectioris  exceeding  g  in  genotypic  value,  Is  readily  accom¬ 
plished  analytically,  but  the  form  of  the  distribution  is  quite  cumber¬ 
some  computationally.  A  more  convenient  characterization  is  given  by  the 
genotypic  distribution  associated  with  the  minimum  or  with  the  maximum 
selected  phenotype,  the  latter  being  equivalent  to  the  distribution  of  m 
for  the  special  case  k  =  ll(l-$)  =  1. 

The  genotypic  distribution  associated  with  the  k*th  ranking  phenotype 
in  a  population  of  size  N  is 

CO 

1-p,^,  «  J  Ci-s(x)?  [a(x)J'-r 

where  ^  denotes  the  probability  that  the  genotype  of  the  phenotypically 
k*th  largest  individual  will  exceed  the  population  mean  by  at  least  an 
amount  gcr^.  For  k  =  1,  or  for  -the  largest  of  N  phenotypes,  this  becomes 


CD 


dSN  (x) 


Numerical  evaluation  of  the  integral  P^^^  for  the  purpose  of  appraising 
a  contemplated  tjpe  (ii)  experimental  plan  is  still  somewhat  tedious,  but 
arbitrarily  close  bounds  on  the  integral  may  be  computed  as 


t  &  ( 
1=0 


Cj-g  A 


J  [I'-k  (Xj+i  )-Fj,  (Xj  )]  <  p,^k  < 


where  -oo  =  3c^<Xj<  ...  <x,  <  =  +  cd  is  any  Judiciously  chosen  raontone 

sequence  and 


-11- 


F^(x)  =  (r)Cl-3(x)]'  [5  (x)]N"r 

rsaO  ^ 

Furthermore,  when  k  is  small  and  N  is  large,  F^(x)  remains  extremely  small 
until  5(x)  gets  near  unity,  and  then  F^  (x)  is  closely  approximated  by 

F,U)  i  e  ‘S' 

peO 

In  particular,  when  k  =  1, 

F^(x)  =  aN(x)  e 

remains  less  than  .005  until  5(x)  attains  the  value 

and  increases  to  .995  at 

*(x.)  =  1 

thus  indicating  a  judicious  range  for  the  sequence  Xj  <  ...  <  x,. 

Certain  special  cases  do  exist  where  the  integred  P,^jt  can  be 
evaluated  explicitly,  the  most  interesting  being  the  case  h  «  ^  and  g  «  0. 
V/hen  half  of  the  phenotypic  variability  in  the  (infinite)  population  is  of 
genetic  origin  then  the  probability  that  the  k’th  ranking  phenotype  in  a 
population  of  size  N  will  be  genetically  above  average  is 

T,  ,  k 

From  this  result  it  follows  that  the  expected  number  of  above  average 
genotypes  among  the  best  k  out  of  N  phenotypes  is 


A  second  but  rather  trivial  special  case  arises  when  h  *  1,  giving 

P,,.  =  i  -  Fk(g) 

which  is  readily  evaluated  from  tables  of  the  standard  normal  distribution. 
While  this  limiting  case  is  of  no  particulai*  genetic  Interest  It  does 
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provide  a  convenient  bound  which  may  be  useful  as  a  check  against  numerical 
integration  at  other  values  of  h« 

Numerical  integration  employing  the  devices  mentioned  above  was 
carried  out  for  the  case  k  =  1  to  obtain  a  solution  to  «  0*9  as  an 

equation  in  N.  These  results^  presented  in  Table  1  as  a  guide  to  ex¬ 
periment  planning,  indicate  the  population  size  required  for  90  percent 
confidence  that  the  first  ranking  selection  will  belong  to  the  top  5#  10, 

25  or  50  percent  of  the  genetic  population «, 


Table  2  Population  size  required  for  90  percent  certainty  that  the 
genotypic  value  of  the  phenotypically  best  Individual  will 
fall  in  a  specified  upper  percentile  of  the  genotypic 
distribution* 


Herl^bility 

h  =  - 

af  +  of 


Percentile  of  the  genotypic  value  of  the  best  phenotype 
Top  Top  Top  Top 

50^  25^  1056  5^ 


0.1 

12,137 

7  X  10^ 

2  X  lOl* 

101  e 

0.2 

14? 

31,i40 

3  X  ICP* 

5  X  109 

0.3 

35 

1,070 

84,145 

2  X  ICP 

0.4 

16 

193 

4,650 

5  X  10* 

0.5 

9 

80 

870 

5,300 

0.6 

7 

34 

254 

1,060 

0.7 

6 

17 

109 

415 

0.8 

5 

l4 

58 

165 

0.9 

4 

11 

34 

82 

1.0 

4 

8 

22 

45 

The  Effect  of  Population  :jize;  After  Selection 

At  the  planning  stage  of  a  selection  experiment  the  phenotypes  to  be 
selected  are  yet  unknown,  except  in  the  fonn  of  a  lower  bound  in  the  case  of 
a  type  (i)  experiment,  and  upon  completion  of  the  phenotypic  selection  the 
genotypic  values  of  the  chosen  individuals  are  yet  unlaiown.  Figures  1-12 
are  therefore  still  of  some  interest  to  the  geneticist  at  the  post-s election 
stage  and,  in  particular.  Figures  l-J  apply  to  describe  the  genotypic 


distribution  associated  with  any  selected  phenotypic  value,  regardless  of 
the  rank  of  that  individual  among  the  selectiora.  If  phenotypic  values  are 
not  actually  measured  in  the  selection  e:^eriment  but  only  congpared  to  a 
standard  value  x  in  a  type  (i)  experiment  (i.e»  phenotypes  are  observed 
only  to  be  greater  or  less  than  x)  or  simply  ranked  in  a  type  (ii)  ex¬ 
periment  then  Figures  1-3  do  not  apply,  and  the  use  of  Figures  4-12  depends 
more  specifically  on  the  type  of  information  available  on  the  selected 
phenotypes . 

In  a  type  (i)  experiment  where  no  information  is  obtained  other  than 
the  exact  number  k  of  phenotypes  exceeding  the  standard  value  x  then  for 
a  random  one  of  these  k  individuals  the  computations  of  Figures  4—12  apply 
directly,  and  for  all  k  individuals  a  binomial  distribution  applies*  The 
probability  that  m  of  the  selected  k  individuals  feiH  In  the  top  100 
percent  of  the  genotypic  distribution  is  simply 

C) 


For  example,  if  k  =  10  individuals  are  found  to  he  phenotypically 
larger  than  a  preassigned  standard  value  of  x  =  1.645  in  a  population  with 
heritability  h  =  0.4  then  the  probability  that  at  least  3  of  these  10 
belong  to  the  top  100  P,  =  5  percent  of  the  genotypic  distribution  is 


(l-P,.,  (l-P.  ..  )»-(2”)P.=.  (l-F,  .. 


10  > 


.X 


From  Figure  7  and  the  curve  =  5  at  x  =  1.645  we  find  «  •353»  giving 


1  -  .6671®  -  10(.333)(,667)®  -  (.333)®(«667)®  =  .6996 


Notice  that  knowledge  of  k  will,  on  the  average,  decrease  the  variance  of 
m  by  a  factor  of  (l-P, ,, )/[l-P, .x+P, ,«3(x)],  or  approximately  by  l-P^  ♦ 
Additional  information  concerning  the  ordering  among  the  k  selected 
phenotypes  in  a  type  (i)  experiment  is  difficult  to  analyze  numerically 
because  of  integration  problems.  The  genotypic  distribution  associated 
with  the  lowest  ranking  selected  phenotype,  for  example,  is  given  by  the 
integreil 


k 

[1-5 (x)] 


k 
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while  that  for  liie  highest  ranking  phenotype  is 

CO 

^  J  ®  C«(y)-2(x)3>‘-id$(y) 

X 

Svich  functions  could  be  integrated  numerically  by  the  methods  employed  in 
computing  Table  1;  however,  the  number  of  cases  to  be  considered  is  quite 
large,  requiring  extensive  tables  or  graphs. 

For  the  purposes  of  reconciling  the  population  sizes  for  a  type  (ii)  ex¬ 
periment  tabxilated  in  the  =  .05  column  of  Table  2  with  those  computed 
earlier  for  a  type  (i)  experiment,  the  genotypic  distribution  for  the 
best  of  k  selected  phenotypes  was  cconputed  with  h  =  g  »  1.645,  x  «  2, 
and  then  compounded  with  the  distribution  of  k  for  N  =  196.  Earlier 
computations  had  shown  that  when  h  =  ^  and  x  =  2  in  a  type  (i)  experiment, 
a  population  size  of  N  =  196  is  required  for  90  percent  confidence  that 
at  least  one  selection  will  belong  to  the  top  5  percent  of  the  genotypic 
distribution.  Table  2,  on  the  other  hand,  indicates  that  in  a  type  (li) 
experiment  a  population  size  of  N  =  5300  is  required  for  90  percent 
confidence  that  the  phenotypically  best  selection  will  belong  to  the  top 
5  percent  of  the  genotypic  distribution. 

Superficially,  these  two  tabulated  results  might  appear  incompatible, 
however  the  conditions  to  be  fulfilled  are  quite  different.  In  the  one 
case  we  require  only  that  at  least  one  of  the  selections  be  genetically 
superior,  while  in  the  other  case  we  require  that  an  identifiable  one 
(the  largest)  of  the  selections  be  genetically  superior.  Clearly,  the 
latter  requirement  is  much  more  stringent  and  the  population  size 
necessary  to  achieve  it  is  correspondingly  much  greater.  For  a  population 
size  of  only  196  in  a  type  (i)  experiment  with  h  =  -^  and  x  *  2  there  is  a 
0.9  probability  that  at  least  one  of  the  selections  will  have  g  >  1.645, 
but  there  is  (by  numerical  integration)  less  than  0.65  probability  that  the 
phenotypic  ally  best  selection  idLll  have  g  >  1.645. 

In  a  type  (ii)  experiment,  phenotypic  ordering  is  an  integral  part 
of  the  selection  process  and  thus  contributes  no  additional  information 
beyond  that  assumed  in  the  planning  stage.  In  this  case  the  number  (k) 
selected  is  fixed  in  advance,  and  the  minimum  selected  phenotype  x^  is  a 
chance  variable  corresponding  to  the  selection  point  x  of  a  type  (l) 


experiment.  In  fact,  if  were  actxxally  measured  while  the  remaining 
top  k-1  phenotypes  were  merely  ranked  then  the  information  on  these  k»l 
individuals  would  be  of  a  type  identical  to  that  obtained  on  the  k 
selections  of  a  type  (i)  experiment,  with  now  playing  the  role  of  x. 

Most  commonly,  all  selected  phenotypes  will  be  measured,  so  that 
the  outcome  of  the  selection  experiment  consists  of  phenotypic  observations 
Xj  >  Xg  >  >x,£,  and  Figures  l-J  then  suffice  to  describe  the  probability 

distribution  of  genotypic  values  associated  wi'tii  each  of  these  phenotypes, 
regardless  of  which  type  of  experiment  was  employed. 


Expected  Identifiable  Numbers  of  Genetic  Deviates 


Powers  Cl9^5)^  Powers  et  al*,  (1958)  and  Dudley  and  Powers  (1959) 
introduced  the  concept  of  identifiable  numbers  of  genetic  deviates  in  the 
phenotypic  classes  of  a  segregating  population.  The  identifiable  numbers 
of  genetic  deviates  are  represented  by  the  differences  between  these  class 
frequencies  and  those  of  a  nonsegregating  population  of  the  same  size  super¬ 
imposed  on  the  same  population  mean.  As  noted  by  Pederer,  Powers,  and  Payne 
(in  process  of  publication)  in  the  normal  case  the  class  frequencies  of  the 
segregating  population  exceed  those  of  the  corresponding  nonsegregatln^ 
population  at  a  distance  of 


z  =  K' 


flog, (1-h) 

IS 


or  more  on  either  side  of  the  mean.  Consequently,  in  the  sense  defined  by 
Powers,  the  frequency  of  identifiable  genetic  deviates  is  positive  in  any 
phenotypic  class  beginning  at  least  a  distance  z  away  from  the  mean  of  an 
infinite  population.  In  particular,  in  the  entire  tail  of  the  distribution 
from  z  to  +OJ  the  frequency  of  identifiable  genetic  deviates  is 

Tl^bi 


.  /log,  (l«h)  /  (l-h)log, 

^  -  <y — ^ 


-io  that  in  a  segregating  population  of  size  H  the  expected  ntaaber  of  superior 
Identifiable  genetic  deviates  is  defined  as  NT*. 


■r-' 


<Jnfortunately,  in  the  present  context  this  definition  raises  certain 
ambiguities,  for  in  this  same  phenotypic  interval  (z,  »)  the  proportion  of 
genotypic  values  exceeding  the  mean  Is 


so  that  K  Po,,t  (l-$(z*))  also  be  described  as  the  ejqjected  number  of 
"superior”  genotypes  falling  in  this  interval.  Confusion  may  be  avoided 
here  by  regarding  I*  as  an  index  of  heritability,  comparable  to  the  herit- 
ability  ratio  h,  rather  than  attempting  to  interpret  I*  as  a  probability; 
in  fact,  for  the  nommil  case,  F*  is  a  monotone  (increasing)  function  of  h 
and  is  therefore  equivalent  to  h  as  a  heritability  index.  When  the  genotypic 
distribution  is  non-normal  then,  of  course,  the  two  indices  P*  and  h  are  no 
longer  equivalent  and,  in  general,  neither  can  be  regarded  as  an  adequate 
index  of  heritability  in  the  sense  of  uniquely  determining  the  genotypic 
distribution  for  a  given  phenotypic  distribution.  If  a  frequency  difference 
is  computed  for  every  phenotypic  class  interval,  however,  all  of  the  in¬ 
formation  in  the  genetic  experiment  is  retained  so  that  in  this  extreme 
if  trivial  form  the  F*  index  has  optimum  properties  for  any  distribution 
model. 


Numerical  Illustrations  With  Sugarbeet  Data 

Experimental  data  fulfilling  all  of  the  requirements  of  the  normal 
model  are  extremely  uncommon,  and  in  the  strictest  sense  are  actually  non¬ 
existent,  Aside  from  the  problem  of  achieving  normality  and  of  achieving 
constant  environmental  variance  by  appropriate  choices  of  scale  of  measure¬ 
ment,  there  is  also  the  practical  problem  of  conducting  a  completely 
randomized  experiment  with  a  single  individual  per  plot  as  called  for  by 
the  simple  model  considered  here.  Hie  data  used  here  for  illustrative 
purposes,  obtained  from  a  population  genetic  study  on  sugarbects  conducted 
in  i960,  cannot  be  rigorously  shown  to  satisfy  any  of  the  requirements  of 
the  model,  and  so  the  application  of  the  model  provides  only  a  crude  guide 
to  the  true  genetic  character  of  the  data.  Until  more  general  methods  of 


genetic  analysis  are  developed,  however,  a  crude  guide  is  eill  that  can  be 
ejected  and  may  serve  as  a  valuable  tool  in  the  analyses  of  such  data* 

For  details  of  the  design  of  the  experiment,  a  description  of  the 
populations  studied,  and  the  adjustment  of  the  frequency  distributions  to 
eliminate  variation  due  to  replications  and  populations  see  Powers  et  eil., 
(in  process  of  publication)*  With  the  exception  of  the  predicted  values 
the  data  in  table  5  were  taken  from  this  article.  They  determined  sucrose 
percentage  of  the  sijgarbeet  roots,  transformed  the  data  to  the  logarithmic 
scale,  and  made  the  adjustments  mentioned  above.  The  resulting  frequency 
distributions  are  thus  freed  of  replication  and  population  mean  effects 
and  hence  are  subject  only  to  within  plot  sources  of  genetic  and  environ¬ 
mental  variation*  Intra  plot  correlations,  positive  before  adjustment, 
are  negative  in  the  adjusted  observations*  Their  effects  are  not  taken 
into  account  in  the  following  analjreis.  The  frequency  distribution  (sea 
Powers  et  eil*,  1958  and  Powers  et  al*,  in  process  of  publication)  present  a 
skew  appearance  indicating  that  the  environmental  distribution  depicted  by 
the  nonsegregating  entry  may  change  shape  with  each  genotype* 

First  the  correspondence  between  the  observed  frequency  difference 
beyond  the  points  of  intersection  of  the  segregating  and  the  nonsegregating 
frequency  distributions  are  considered.  The  method  of  identifying  these 
points  of  intersection  is  given  by  Powers  et  al*,  (1958).  The  predicted 
frequency  differences,  based  on  the  obsezved  within  plot  beritability 
ratio  and  the  normal  theory  formula  for  MF*‘  are  shown  in  table  5  for  each 
of  the  segregating  populations*  For  example,  with  a  beritability  ratio 
of  h  =  ,60569  and  H  =  ^50  the  predicted  frequency  difference  to  the  right  of 

-y - :::sm - 

13  .  450  [s  -  *  (.77636)] 

=  450  [.8924  -  .7808]  =  50 

Examination  of  table  3  reveals  that  this  prediction  corresponds  fairly 
closely  with  observation  for  most  entries,  though  a  few  major  discrepancies 
reduce  the  correlation  between  observed  and  predicted  to  Ow?.  A  somewhat 
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Table  3  Comparison  of  observed  and  predicted  identifiable  numbers  of 

genetic  deviates*  i960  Group  I,  log  percentage  sucrose,  N  =*  450.  ^ 


Population  and  entry  Heritability  Identifiable  numbers  of  genetic  deviates 


ratio 

h 

■Superior  . 

observed  predicted 

.  Total _ - 

observed  jxredlcted 

CMS  X  4w-34,  1 

.60569 

45 

50 

87 

100 

CMS  X  A54-1,  2 

.62211 

52 

52 

97 

104 

A54-1,  3 

.63228 

60 

53 

108 

106 

CMS  X  4W-34  Sg,  4 

.1*8490 

42 

35 

75 

70 

4W-34  %,  5 

.59884 

45 

49 

77 

97 

CMS  X  4w-34  asexual, 
recurrent,  6 

.52778 

44 

4o 

82 

81 

4w-34  asexual, 
rectirrent,  7 

,41712 

25 

29 

52 

58 

52-430  X  54-520  Fj ,  8 

,41019 

32 

29 

64 

57 

54-520  X  52-505  Fj ,  10 

•35738 

33 

24 

64 

48 

A56-5,  11 

.51308 

56 

39 

94 

78 

54-520,  12 

.67032 

44 

59 

83 

118 

Total 

478 

459 

883 

917 

ss. 

21864 

20499 

73541 

6184? 

SCP 

20794 

76459 

C.T. 

20771.27 

19152.82 

70880.82 

76444,45 

19945.64 

73610.09 

1092,73  1346,18 

2660.18  5402.55 

848.36 

2848.91 

r  =  .6995 

r  =  ,7515 

^  V/lth  exception  of  the  predicted  values  the  data  are  taken  from  Powers, 
Renanenga,  and  Urquliart  (in  process  of  publication). 


-19- 


surprlslng  side  result  is  the  high  correlation  of  *99976  between  h  and 
HI*  in  this  table,  indicating  that  despite  this  analytically  complicated 
form,  HI*  is  a  nearly  linear  function  of  h.  In  li^t  of  these  many  clear 
and  suspected  violations  of  the  model,  the  degree  of  fit  to  the  model 
predictions  are  somewhat  surprising,  and  lend  some  credence  to  those  assump¬ 
tions  which  cannot  be  directly  checked* 

Another  direct  measure  of  departure  from  normality  is  given  in  table  4, 
where  the  observed  frequencies  in  the  upper  tails  of  the  distributions 
(X  >  X*)  are  compared  to  predicted  frequencies  (N(1-S(x*)))o  Here  the 
skewness  of  the  phenotypic  distributions  is  made  more  apparent,  the  predicted 
upper  tail  frequencies  almost  always  exceeding  the  observed*  The  a  poster¬ 
iori  probability  of  a  superior  genetic  deviate  in  the  tail,  computed  by 
interpolation  from  figures  4-12,  would  therefore  appear  to  overestimate  the 
true  probability  and  hence  have  been  adjusted  downward*  For  example,  in  the 
region  X  >  g  +  1,2667  +  cf  of  a  normal  segregating  population  with 

h  a  ,60569  the  proportion  of  genotypes  falling  in  the  first  quartile  of  the 
(normal)  genotypic  distribution  is  given  by  linear  interpolation  between 
the  points  =  25,  x  =  1,2667  in  figures  9  Q»d  10.  For  h  «  .6  in  figure  9 
at  X  1,2667  and  =  25  we  find  P^^j,  =  ,825  and  for  h  »  ,7  in  figure  10, 

P, =  .887»  Interpolation  to  h  =  ,60569  then  gives 

Pj.x  =  .60569  (.887  -  .825)  +  .825  =  .328 

and  since  52  individuals  instead  of  the  predicted  46  belonged  to  this  tail 
of  the  observed  (standardized)  phenotypic  distribution  then  the  predicted 
number  of  genotypes 

52  Pj^jt  =  52  (.828)  =  26 

was  arbitrarily  adjusted  downward  by  the  factor  52/46,  giving 


Table  Normal  theory  predicted  proportions  of  superior  genetic  deviates  based  on  adjuslanent  5,  combining 
Group  I,  N  =  450,  D.  F.  =  420. 
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Appendix 

If  X  a  G  +  E  where  G  and  E  are  Independent  normal  chance  variables  with 
means  g  and  0  and  variances  and  of,  respectively,  then  the  conditional' 
distribution  of  Q  for  fixed  X  is  normal  with 


ave  (Glx)  a  ave  (g)  +  [x-ave  (x)] 

c? 

(x-i) 


=  g  + 


of + 


var  (0|x)  =  (0)  [l  - 


Consequently,  for 


X*  a 


of  +  cf  /oj=  + 


fi  "" — s  ”  0, 


the  conditional  distribution  of  G  is  expressible  as 


(1) 


p;ocg|x)  =  p  <  £--■  m.  .(gNii 

Vvar(G(x)  /vear(G|x) 

.  5  „  1 .  a  r^i 

vfi-h)  y(-L^h.s 


where  2  is  the  standard  (cumulative)  normal  distribution  function* 
Similarly,  under  the  condition  X>x 

CO 

l-P,„=P(G<g|X>jt)  »  J  P(G  <6ly)dP(X<y) 


(2) 


-2A- 

For  the  special  case  g  »  g  and  h  **  yii(l  -  h)  =  this  expression 
reduces  to 

CQ 

P(G  <i[X>x)  =  J  3  (-y)  d5  (y) 

Z  * 

00 

”  ^  I  *  (y)  4  *  (y) 

x» 


=  1  -  I  [1  +  S(x»)] 

while  (l)  beccanes  simply 

P(G<g[x)  =  1  -  5(x») 

IfX„<  •••  <Xj  d3note  the  ranked  observations  in  a  random  sample  of 
size  N  then  the  distribution  of  the  G»  component  of  is  given  by 

00 

P(Gk<g)  =  Jp(G<6|x)  dP(Xj.<x) 


oo  _ 

=  k  J  5  (“)  [3(x»)?-''Cl-3(x»)?-^d3Cx‘) 


/(l^T 

For  the  special  case  g  =  g  and  h  =i  ^  this  gives 

00 

F(Gk<g)  =  k  J  [1  -  ai(x»)]  (“)  [5(x»)?*ni  -  SCxMp-^dSCx*) 


N  +  1 


BO  the  expected  number  of  G-  components  less  than  g  among  the  k  largest 
observations  X,j  <  ..*  <Xj  is  the  sum 

V  -J—  - 

L  +  1  2  (N  +  l) 


then 

P(,.  =  X)=  ^(y)  =  p.,. 

and 

p(  IT,  =  t)» 

In  particular. 


p(i;T,  ai)=i-  a-p.^.)« 


O.Cl  0,05  0.1  C,2  0,5  I  :  5  10  OC  50  40  00  (.0  IS  E.i  SO  95  98  99  99i  99.3  99.99 

GENOTYPIC  DISTRIBUTION  FOR  SELECTIONS  AT  THE  95%  POINT  OF  THE  PHENOTYPIC  DISTRIBUTION 


6  -  31VlA3a  OI13N30  ayvQNvis 


GENOTYPIC  DISTRIBUTION  FOR  SELECTIONS  AT  THE 


PERCENTAGE  BELONGING  TO  THE  UPPER  P-  PROPORTION  OF  THE  GENETIC  POPULATION 


PHENOTYPIC  SELECTION  POINT  -x 


SELECTION  RATE  -100  I- |(x)  percent 


PERCENTAGE  BELONGING  TO  THE  UPPER  P- PROPORTION  OF  THE  GENETIC  POPULATION 


PHENOTYPIC  SELECTION  POINT-  x 


PERCENTAGE  BELONGING  TO  THE  UPPER  P-  PROPORTION  OF  THE  GENETIC  POPULATION 


PHENOTYPIC  SELECTION  POINT-  x 


93;99 _  39,9  99, S 


sss 


BB^S^W^b^BSh  I 

*iflMi^9RHHHHiHHHi  HI^HlB^Si^H^B 

H[H|i 


■U1:  -fe 

o  Ol 


U  CjJ 

ui  o 


SELECTION  RATE  -  iOo|T-$l*)J  percent 


0.25 


PERCENTAGE  BELONGING  TO  THE  UPPER  P- PROPORTION  OF  THE  GENETIC  POPULATION 


0. 25 


1.75 


PERCENTAGE  BELONGING  TO  THE  UPPER  P-  PROPORTION  OF  THE  GENETIC  POPULATION 


_ 99.9  99.J 


9S  0 


PHENOTYPIC  CLASS  FREQUENCY  -  percent 


CORNELL  UNIVERSITY  -  BIOMETRICS  UNIT 

Distribution  List  for  Unclassified  Technical  Reports 

Contract  Nonr-40l(39) 

Project  (NR  01^2-212) 


Head,  Logistics  and  Ms.thenia- 
tical  Statistics  Breuich 
Office  of  Naval  Research 
Washington  25,  D,  Ct  3 

Commanding  Officer 
Office  of  Naval  Research 
3^6  Broadway 

New  York  13,  New  York  1 

AS-.fIA  Document  Service  Center 

Arlington  Hall  Station 

Arlington  12,  Virginia  10 

Institute  for  Defense  Analyses 
Communications  Research  Division 
von  Neumann  Hall 

Princeton,  New  Jersey  1 

Technical  Information  Officer 
Naval  Research  Laboratory 
Washington  25,  D.  C.  6 


Professor  T.  W.  Anderson 
Department  of  Mathematical 
Statistics 
Columbia  University 
New  York  27,  New  York  1 

Professor  Z.  W.  Birnbaum 
Laboratory  of  Statistical  Research 
Department  of  Mathematics 
University  of  V/ashington 
Seattle  5,  Washington  1 

Professor  A.  V,  Bowker 
Applied  Mathematics  and 
Statistics  Laboratory 
Stanford  University 
Stanford,  California  1 

Professor  Ralph  A.  Bradley 
Departnent  of  Statistics 
Floridfi  State  University 
Tallahassee,  .Florida  1 

Dr.  Jeto  W.  Cell 
Department  of  Mathematics 
North  Carolina  State  College 
Raleigh,  North  Ga'.’olina  1 


Professor  Herman  Chernoff 
Applied  l-Iathematics  and 
Statistics  Laboratory 
Stanford  University 
Stanford,  California  1 

Professor  W,  G«  Cochran 
Department  of  Statistics 
Hea^vard  University 

Cambridge,  Massachusetts  1 

Dr.  C,  Clark  Cockerham 
Institute  of  Statistics 
North  Carolina  State  College 
Raleigh,  North  Carolina  1 

Professor  Cyrus  Derman 

Dept,  of  Industrial  Engineering 

Columbia  University 

New  York  27,  New  York  1 

Professor  Benjamin  Epstein 
Applied  Mathematics  and 
Statistics  Laboratory 
Stanford  University 
Stanford,  California  1 

Dr.  W,  T.  Federer 
Biometrics  Unit 
Plant  Breeding  Department 
Cornell  University 

Ithaca,  New  York  1 

Dr.  R.  J.  Freund 
Department  of  Statistics  and 
Statistical  Laboratory 
Virginia  Polytechnic  Institute 
Blacksburg,  Virginia  1 

Professor  H.  P.  Goode 
Dept,  of  Industrial  and 
Engineering  Administration 
Cornell  University 

Ithaca,  New  York  1 

Professor  W,  Hlrsch 

Institute  of  Mathematical  Sciences 

New  York  3,  Kew  York  1 


-2. 


Professor  Harold  Hotelling, 

Associate  Director 
Institute  of  Statistics 
University  of  North  Carolina 
Chapel  Hill,  North  Carolina  1 

Professor  Oscar  Kenrpthorne 

Statistics  laboratory 

Iowa  State  University 

Ames,  Iowa  1 

Professor  Gerald  J.  Lieberman 
Applied  Mathematics  and 
Statistics  Laboratory 
Stanford  University 
Stanford,  California  1 

Dr.  Arthur  E.  Mace 
Eattelle  Memorial  Institute 
505  King  Avenue 

Colxirob  s  1,  Ohio  1 

Professor  J.  Neyman 
Department  of  Statistics 
University  of  California 
Berkeley  h,  California  1 

Dr.  F.  Oberhettinger 

Department  of  Mathematics 

Oregon  State  College 

Corvallis,  Oregon  1 

Professor  Herbert  Robbins 

Mathematical  Statistics  Dept, 

Fayerweather  Hall 

Columbia  University 

New  York  27,  New  York  1 

Professor  Murray  Rosenblatt 
Department  of  {'la thematic  s 
Brovm  University 

Providence  12,  Rhode  Island  1 

Professor  H.  Rubin 
Deportment  of  Statistics 
Mlchlgr-n  State  University 
East  Lansing,  Michigan 


Professor  I.  R.  Savage 

School  of  Business  Administration 

University  of  Minnesota 

Minneapolis,  Minnesota  1 

Professor  L,  J,  Savage 
Statistical  Research  Laboratory 
Chicago  University 

Chicago  37,  Illinois  1 

Professor  W,  L.  Smith 
Statistics  Department 
University  of  North  Carolina 
Chapel  Hill,  North  Carolina  1 

Professor  Frank  Spitzer 
Department  of  Mathematics 
University  of  Minnesota 
Minneapolis,  Minnesota  1 

Dr,  H,  Teicher 
Statistical  Laboratory 
Engineering  Administration  Building 
Purdue  University 

Lafayette,  Indiana  1 

Professor  M.  B,  Wilk 

Statistics  Center 

Rutgers  -  The  State  University 

New  Brunswick,  New  Jersey  1 

Professor  S,  S.  V/ilks 

Department  of  Mathematics 

Princeton  University 

Princeton,  New  Jersey  1 

Professor  J.  Wolfowitz 
Department  of  Mathematics 
\^ite  Hall 
Cornell  University 

Ithaca,  New  York  1 


1 


